搜索资源列表
ansj
- ictclass的java版本开源实现,用于实现中文分词功能。-the java version that the open source implementation of ictclass
ansj_seg20121123
- java分词实现,可以自己定义词库-java fenci,upload user library
IKAnalyzer2012
- IKAnalyzer是一个开源的,基于java语言开发的轻量级的中文分词工具包。-IKAnalyzer is an open source, based on the the lightweight java language development of Chinese word Kit.
Segment
- java实现的分词操作,可用于将一句话按照汉语习惯分成对应词-java achieve segmentation operation, can be divided into the corresponding word sentence in accordance with the Chinese habit
fencizf
- 基于java编程,采用最大匹配算法实现简单的中文分词,并过滤停用词-The maximum matching algorithm based on java programming, simple Chinese word segmentation and filtering stop words
CWSSFenci
- java基于字典的分词,字典存储结构采用Hash表,并和Lucene的token流接口相结合,可以再lucene中使用-Hash tables java dictionary-based segmentation, dictionary storage structure and lucene in use and Lucene token stream interface combined
hlssplit
- hlssplit分词系统,使用C++编写,提供java接口,非常好用的分词工具,严重推荐!-the hlssplit word system written using the C++, java interface, very easy to use segmentation tools seriously recommend!
com
- 使用java语言开发的分词器源码,可结合luncene使用,效果很好-Word segmentation is developed using java language, can be used in conjunction with the luncene, the effect is very good
IKAnalyzer3.2.0Stable_bin
- IKAnalyzer是一个开源的,基于java语言开发的轻量级的中文分词工具包。从2006年12月推出1.0版开始,IKAnalyzer已经推出了3个大版本。最初,它是以开源项目Luence为应用主体的,结合词典分词和文法分析算法的中文分词组件。新版本的IKAnalyzer3.0则发展为面向Java的公用分词组件,独立于Lucene项目,同时提供了对Lucene的默认优化实现。 -IKAnalyzer is an open source toolkit, Chinese word segm
ICTCLAS50
- 基于中科院分词作的java分词工具,内容详细各个函数都有实现,内含有word解析文档-For java-based CAS-word segmentation tool, details of each function has to achieve, which contains the word parse the document
chinese-analyzer
- 基于中科院的分词系统修改的java版的中文分词系统-CAS-term system based on the modified version of java Chinese word segmentation system
Win-32bit-JNI-lib
- java实现的NLPIR汉语分词系统源代码-java implementation NLPIR Chinese word segmentation system source code
nlu_project
- 采用机器学习的方法进行自然语言处理,对中文进行分词和词性标注。分词采用crf模型,词性标注用hmm模型,解码算法为Vertibi算法。本系统使用java语言编写-Using machine learning methods for natural language processing, carried out on the Chinese word segmentation and POS tagging. Segmentation using crf model, tagging with
Ictclas
- 中文分词的java实现实例。包括词性标注和分词等功能。-Chinese word segmentation to achieve the java instance. Including word tagging and other functions.
ICTCLAS50_Windows_32_JNI
- ICTCLAS50_Windows_32_JNI-分词工具是中科院研究开发的一款很实用的java切词工具!-ICTCLAS50_Windows_32_JNI-word research tool is the development of a CAS is very useful java segmentation tool!
AnalyzerTest
- java中文分词lucene,可以实现中英文分词功能,查询功能!-Chinese word java lucene, can be achieved in the English word function, search function!
split_words
- 分词程序,正向最大匹配法,JAVA语言。核心思想是从句子最左端开始,单字节扫描匹配,直至句末-Segmentation procedure, forward maximum matching, JAVA language. Core idea is to start from the leftmost sentence, single-byte scan match until end of the sentence
jcseg-1.9.0-src-jar-dict
- java中英文分词工具源码包,分词准确,性能快.-jcseg-1.9.0-java source
0nlu_project
- 本系统使用java语言编写,采用机器学习的方法进行自然语言处理,对中文进行分词和词性标注。分词采用crf模型,词性标注用hmm模型,解码算法为Vertibi算法。-The system uses java language, using machine learning methods for natural language processing, for Chinese word segmentation and POS tagging. Segmentation using crf mod
chinese-analyzer
- java开发的的分词系统修改的的中文分词系统-java development of the sub-system changes the Chinese word segmentation system